A Multi-label Classification on Topic of Hadith Verses in Indonesian Translation using CART and Bagging

نویسندگان

چکیده

Hadith is a source of law for Muslims after the al-qur'an, in which there are instructions form words, actions, attitudes, and others. must be studied practiced by Muslims, then used as way life al-qur'an. Classifying hadith to make it easier learn looking at text pattern translation Bukhari based on three classes or categories suggestions, prohibitions, information. The classification carried out multi-label classification. process uses N-gram TF-IDF feature extraction, CART bagging methods, hamming loss evaluation methods. Bagging cover shortcomings CART, namely, model less stable, which, if slight change training data, will have significant effect resulting learning model. Several testing methods were obtain best hammer value this study. Based several tests that been out, 0.1914 80.86%. These results indicate use can help increase accuracy 5%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Label Classification from Multiple Noisy Sources Using Topic Models

Multi-label classification is a well-known supervised machine learning setting where each instance is associated with multiple classes. Examples include annotation of images with multiple labels, assigning multiple tags for a web page, etc. Since several labels can be assigned to a single instance, one of the key challenges in this problem is to learn the correlations between the classes. Our f...

متن کامل

on translation of politeness strategies in dialogues involving female characters in translations and retranslations of novels translated before and after the islamic revolution of iran and their effects on the image of women: a polysystem theory approach

abstract reception environment has considerable effects on accepting a translation. as the expectations of a target culture and its values and needs change throughout history, its criteria for accepting a translation or rejecting it will change accordingly (gentzler, 2001). the expectations of iran, as the reception environment in the present study, have changed after the islamic revolution. i...

Topic Modeling and Classification of Cyberspace Papers Using Text Mining

The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...

متن کامل

reflections on taught courses of the iranian ma program in english translation: a mixed-methods study

the issue of curriculum and syllabus evaluation and revision has been in center of attention right from when curriculum came into attention of educational institutions. thus everywhere in the world in educational institutions curricula and syllabi are evaluated and revised based on the goals, the needs, existing content, etc.. in iran any curriculum is designed in a committee of specialists and...

Exploiting Associations between Class Labels in Multi-label Classification

Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jurnal media informatika Budidarma

سال: 2022

ISSN: ['2548-8368', '2614-5278']

DOI: https://doi.org/10.30865/mib.v6i2.3787